Search Results

Stanford CS25: V1 I Transformer Circuits, Induction Heads, In-Context Learning

Stanford CS25: V1 I Transformer Circuits, Induction Heads, In-Context Learning

A Walkthrough of In-Context Learning and Induction Heads Part 1 of 2 (w/ Charles Frye)

A Walkthrough of In-Context Learning and Induction Heads Part 1 of 2 (w/ Charles Frye)

Stanford CS25: V1 I Decision Transformer: Reinforcement Learning via Sequence Modeling

Stanford CS25: V1 I Decision Transformer: Reinforcement Learning via Sequence Modeling

Catherine Olsson - Induction Heads

Catherine Olsson - Induction Heads

Stanford CS25: V1 I Self Attention and Non-parametric transformers (NPTs)

Stanford CS25: V1 I Self Attention and Non-parametric transformers (NPTs)

Understanding ICL: Induction Heads (Natural Language Processing at UT Austin)

Understanding ICL: Induction Heads (Natural Language Processing at UT Austin)

SLT Summit 2023 - Induction Heads and Phase Transitions (Mech Interp 2)

SLT Summit 2023 - Induction Heads and Phase Transitions (Mech Interp 2)

EleutherAI Interpretability Reading Group 220423: In-context learning and induction heads

EleutherAI Interpretability Reading Group 220423: In-context learning and induction heads

Stanford CS25: V1 I Transformers in Language: The development of GPT Models, GPT3

Stanford CS25: V1 I Transformers in Language: The development of GPT Models, GPT3

A Walkthrough of A Mathematical Framework for Transformer Circuits

A Walkthrough of A Mathematical Framework for Transformer Circuits

Attention - General - Copying & Induction heads [rough early thoughts]

Attention - General - Copying & Induction heads [rough early thoughts]

Stanford CS25: V1 I DeepMind's Perceiver and Perceiver IO: new data family architecture

Stanford CS25: V1 I DeepMind's Perceiver and Perceiver IO: new data family architecture